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Abstract. In classical extreme value theory probabilities of extreme events are estimated assuming all 
the components of a random vector to be in a domain of attraction of an extreme value distribution. In 
contrast, the conditional extreme value model assumes a domain of attraction condition on a sub-collection 
of the components of a multivariate random vector. This model has been studied in Heffcrnan and Tawn] 
(2004j), |He"ffernan and Resnick| p007| l, |Das and Resnick] | |2008b| ). In this paper we propose three statistics 
which act as tools to detect this model in a bivariate set-up. In addition, the proposed statistics also help 
to distinguish between two forms of the limit measure that is obtained in the model. 



1. Introduction 



Extreme value theory (henceforth abbreviated EVT) is used to model environmental, financial and internet 
traffic data. Multivariate extreme value theory assumes an extreme- valued domain of attraction condition on 
the joint distribution of a random vector which after a suitable standardization relates multivariate extreme 
value theory to regular variation on M^. Probabilities of critical risk events can be estimated under the 
regularly varying assumption and then related back to the original co-ordinate system. Such estimates are a 
function of the dependence structure among the variables in the multivariate model. There is a substantial 
literature on the behavior of multivariate extreme- value theory both in the presence of asymptotic dependence 
and asymptotic independence (IColes and Tawnl |1991[ ILedford and Tawn[ [19961 [19971 11998[ |de Haan and 



de Ronde 1998 Resnick 2002, Maulik and Rcsnick 2005 dc Haan and Ferreira 



2006) 



A conditional extreme value (CEV) model was proposed in|Heffernan and Tawn ( 2004 1, where multivariate 



distributions were approximated by conditioning on one of the components being in an extreme- value domain. 
The authors showed that this approach incorporated a variety of examples of different types of asymptotic 
dependence and asymptotic independence. Their statistical ideas were given a more mathematical framework 



by Heffernan and Resnick ( 2007 ) after slight changes in assumptions to make the theory more probabilistically 



viable. A further study (Das and Resnick (2008b)) revealed that assuming a conditional extreme value model 



holds no matter what conditioning variable is chosen is equivalent to assuming a multivariate extreme- 
value domain of attraction on the entire random vector. It is known that in the presence of asymptotic 
independence, the limit measure in the multivariate EVT set up has an empty interior; in other words, the 
limit measure concentrates on the boundary of the state space. In such a case an additional assumption of 
a CEV model provides more insight into the dependence structure. The CEV model also provides a way for 
modeling multivariate data assuming a subset rather than the entire vector to be extreme- valued. In this 
paper we suggest situations where a CEV model can be used and suggest techniques to detect the model 
and in the process also detect characteristics of the limit measure for the model. The methodologies are 
suggested for a bivariate data set. 

Section [T] provides an introduction and review of the model. Secti on [2] de als with the detection of a 
conditional extreme value model. It has been shown in Das and Resnick (2008b I that the CEV model can be 



standardized to regular variation on a special cone if and only if the limit measure involved is not a product. 
In case of a product measure in the limit, we need to estimate fewer parameters and calculating probabilities is 



also simpler. Fougeres and Soulier (20081 suggests some estimates for parameters and normalizing constants 
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in the two different cases (product and non-product limit measures). Hence it is important to know whether 
we are in the product case or not. In Section [2] we propose three statistics whose behavior can first of all 
indicate the appropriateness of the CEV model and secondly indicate whether the limit measure is a product 
or not. Section [3] is dedicated to applying our techniques to some simulated and real data coming from 
Internet traffic studies. 

1.1. Preliminaries: The CEV model and some related results. We discuss the model and preliminary 
results in this subsection. Refer to Heffernan and Resnick (20071 and Das and Resnick (2008b I for further 
discussion on conditional extreme value models. 

A univariate extreme value distribution G 7 , for 7 £ R, is defined as 

(1.1) G 7 (x) = exp{-(l + 7 ;r)~ 1/7 }, igEW, 

where E^ = {x S M : 1 + 72; > 0}. For 7 = 0, the distribution function is interpreted as Go(x) = e~ 
E<°) := R. We let E (7) be the right closure of E^; that is, 



E (7) = 



'(-^°o] 



Let (X, Y) be a random vector in 



7>0, 
7 = 0, 
7 < 0. 

I 2 . For a CEV model we make the following assumptions: 



(—00, 00] 



(1) The distribution F(x) :— P[Y < x] is in the domain of attraction of an extreme value distribution, 
Gj, written F £ D(G~ f ). This means there exist functions a(t) > 0, b(t) £ R such that, as t — > 00, 



(1.2) 



t(l-F(a(t)y + b(t)))=tI> 



a{t) 



> V 



for y £ EW. 
(2) There exist functions a(t) > and j3(t) £ I 
[ — 00, 00] x E*' 7 '' such that for each y £ E^ 7 ', 



^(l + 72/)" 1/7 , l + jy>0, 

and a non-null Radon measure /1 on Borel subsets of 



(1.3) 



(1.4) 
(1.5) 
(1.6) 



/P 



1 a(t) " ' a(t) ^ 



as i 



00, 



lb] 
[c] 
[d] 



for {x, y) continuity points of the limit, 

/i([— 00, x] x (y, 00]) is not a degenerate distribution in x, 
/i([ — 00, x] x (y, 00]) < 00. 

H(x) := jtt([— cxd,x] x (0,oo]) is a probability distribution. 



A non-null Radon measure /Lt(-) is said to satisfy the conditional non- degeneracy conditions if both of (1.4 1 
and (1.5| hold. We say that (X,Y) follows a conditional extreme value model (or CEV model) if conditions 
(1) and (2) above are satisfied. The reason for this name is that, assuming (x, 0) to be a continuity point of 
fi, OJ, (Oh and (Oh imply that 



(1.7) 



x - m 

a(t) 



< x 



Y > b(t) -> H(x) = n([-oo, x] x (0, 00]) (t -> 00). 



Note that ( 1.3 1 can be viewed in terms of vague convergence of measures in 



- ([—00, 00] x E*' 7 '') , the space 

of Radon measures on [—00, 00] x E 

In this model the transformation Y 1— > Y* = b*~(Y) standardizes the y-variable, i.e., we can assume 
a*(t) = t,b*(t) = 0. Hence a reformulation of (1.3 1 leads to 



(1.8) 



fP 



X - (3(t) Y* 



00, x] x (y, 00]), as t 
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for (x, y) continuity points of /i* where, 



(1.9) 



ff([-oc,x] x (y,oo]) = 



=])» 



if 7 + 0, 
if 7 = 0. 



fi([-oo,x] x (logy,oo]), 

We also know (|Heffernan and Resnick 2007 Proposition 1) that the following variational property holds: 



there exists functions V^G) such that for all c > 0, 

a(tc) 



lim 



A.N V'l(c), 

c p for some p G 



lim 



(1.10) 

i. — * 

This implies that "01(c) 

either or 02 (c) = fc cP-1 for some /c ^ (de Haan and Ferreira 



/?(*c) ~ /?(*) 
a(t) 



</> 2 (c). 



( de Haan and Ferreira 2006| Theorem B.1.3). Also, ip 2 can be 



2006 



Theorem B.2.1). 



Remark 1.1. The CEV model primarily differs from the multivariate extreme value model in the domain 



of attraction condition. Das and Resnick (2008b I provides conditions under which a CEV model can be 



extended to multivariate extreme value model. Under the multivariate extreme value model, each of the 
variables can be standardized so that we have a multivariate regular variation on the cone [0, oo] \ {0}; see 
de Haan and Resnick (19771 and Chapter 6 of de Haan and Ferreira (20061. The conditional extreme value 



model can be standardized if and only if the limit measure \i in (1.3 1 is not a product measure (Das and 



Resnick 2008b). When both X and Y are standardized, we can characterize the limit measure in terms of 



all Radon measures (finite and infinite) on [0, 1). Though theoretically elegant, performing standardization 
in practice is not an easy task. 

Thus, it is important to know when the limit is a product measure. A product measure in the limit 



precludes standardization of both the variables ( Heffernan and Resnick 2007 1 and means that we do not 



have a multivariate extreme value model (Das and Resnick, 2008b I. However, a product measure makes the 
estimation of certain parameters and probabilities easier. For instance, in the product case (ipi, ^2) = (1)0) 



so p = (see (1.10)) but without the property that \i is a product, p has to be estimated. Furthermore, the 
limit being a product measure can be considered as a form of asymptotic independence in the CEV model, 



which can be probabilistically useful (Maulik et al. 20021 



1.2. Appropriateness of the CEV model. Multivariate extreme value theory provides a rich literature 
on estimation of probabilities of extreme regions containing few or no data points in the sample. The 
multivariate theory assumes that each variable is marginally in an extreme value domain of attraction. 
However, this might not be the right assumption for all data sets. We encounter data where one or some but 
not all the variables can be assumed to be in an extreme value domain; see Section |3.2| The CEV model is 
a candidate model in such cases. 

Another circumstance where the CEV model can be helpful is if one has a multivariate extreme value 
model with limit measure v possessing asymptotic independence. This means that in the standardized 
model, the limit measure, concentrates on the axes through {0} and z/*((0,oo]) = 0. So an estimate 

of the probability of a region where both variables are big will turn out to be zero which may be a useless 
and misleading estimate. In such a circumstance, finer estimates can be obtained using either hidden regular 
variation (iMaulik and Resnick 2005 1 or the CEV model. Both methods provide a non-zero limit measure by 



using normalization functions which are of different order from the multivariate EV model. The relationship 
between the multivariate EV model and the CEV model and the respective normalizing functions considered 
in 



Das and Resnick (2008b) 



So, how do we decide if the CEV model is appropriate for multivariate data? 
(1) Start by checking whether any of the marginal variables belongs to the domain of attraction of an 
extreme value distribution. An informal way to do this is through plots of the estimators of the 
extreme value parameter 7 (Pickands plot, Moment estimator plot, etc). If the plot attains stability 
in some range it is reasonable to assume an extreme-value model. More formal methods for testing 
membership in a domain of attraction using quantile and distribution functions are discussed in 



de Haan and Ferreira (2006), Chapter 5.2. The special case of a heavy-tailed random variable can be 



detected using the QQ plot, plotting the theoretical quantilcs of the exponential distribution versus 
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(2) 
(3) 



the logarithm of the sorted data and checking for linearity in the high values of the data. This is 
reviewed in |Resnick| ( |2007 1 and Das and Resnick| ( |2008a[ ) . 

If some, but not all, marginal variables are in a domain of attraction, proceed to see if the data is 
consistent with the CEV model. See Section [2] 

If all variables are in some extreme value domain, check if the multivariate extreme value model is 
appropriate and if asymptotic independence is present. One way to do this is by checking whether 
both maximum and minimum of the standardized variables have distributions with regularly varying 
tails ( |Coles et al. 1999 Resnick 2002}. If the EV model is appropriate and asymptotic independence 



is absent, the CEV model does not provide any more information than the EV model. On the other 
hand if asymptotic independence is present, the CEV model, if detected, provides supplementary 
information about the joint behavior of the variables away from at least one of the axes. 

2. Three estimators for detecting the CEV model 

Let (Xi, Yi), . . . , (X n , Y n ) be a bivariate random sample. In this section we propose three statistics to 
detect whether our sample is consistent with the CEV model under the assumption that at least one of the 
variables is in an extreme-value domain, and without loss of generality we assume Y to be that variable. 
Our statistics have a consistency property which allows detection of a product form for the limit measure. 

Assume (Xi, Yi), . . . , (X n , Y n ) is iid from a CEV model as defined in Section |1.1| We first formulate a 
consequence of (1.3) which will be convenient for our purpose. The following notations will be used. 

> Y/ n ) The decreasing order statistics of Yi, 



x*, 

Rf -- 



> 



1 < i < n 



Y 

The X- variable corresponding to Y(i), also called the concomitant of Yyy 



E 1{X ; *<A7> 



X*. k < X%, k < 



< X 



k:k 



Rank of X* among X'£ , . . . , X k . We write Ri 
The increasing order statistics of X*, . . . , X k . 



R 



2.1. A consequence for empirical measures. When the CEV property holds, a family of point processes 
of ranks of the sample converge vaguely to a Radon measure. By transforming to ranks of the data, we 
presumably lose efficiency since only the relative ordering in the sample remains unchanged but detection 
of the CEV property is easier since we no longer need to estimate the various parameters of the model. See 



dc Haan and de Ronde ( 1998 1 , 



de Haan and Ferreira (2006), Resnick (2007) 



The convergence statement (1.3 1 of the CEV model defined in Section 1.1 can be interpreted in terms of 
vague convergence of measures, 
notation and concepts. Let E* 



In preparation for the forthcoming result we recall some commonly used 
be a locally compact space with a countable base (for example, a finite 
dimensional Euclidean space). We denote by M + (E*), the non-negative Radon measures on Borel subsets 
of E*. If fi n S M + (E*) for n > 0, then \i n converge vaguely to /io (written \x n —> fio) if for all bounded 
continuous functions / with compact support we have 



fd(J, n 



fd(j, (ra— >oo). 



This concept allows us to write (1.3 1 as 

' ( X-(i(t) Y-b(t) \ 
\ a(t) ' a(t) J 



(2.1) 



tP 



/i(-), as t 



_( 7 ) 

in M_|_([ — oo, oo] x E ). Standard references include Kallenberg (1983), 
Chapter 3). 

Recall the definition of fi* in ( |1.9[ ) and define the measure L(-) € M + ([0, 1] x [l,oo]) by 
(2.2) L([0,x] xG,,oo])=/i*([-oo,fr-(aO] x[y,oo}), (x, y) e [0, 1] x (1, oo]. 

Applying a reciprocal transformation to the second coordinate, 

To ■ (x,y) h-> (x^^ 1 ) 



Ncvcu 



(1977) and 



Resnick 



(2008 
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converts L into the copula L o Tq 1 . 

Proposition 2.1. Suppose {X\, Yi), (X2, 12), • ■ • , (-Xn, Y„) a^e i.i.d. observations from a CEV model which 



follows (1.2l-(1.6l and suppose H is continuous. If k — k(n) — > 00, n — > 00 with k/n — > 0, then in 
M+([0,1] x [l,oo]) 

k 
i=l 



Proof. From (2.1 1 and (Resnick 2007| Theorem 5.3(h)), as n, k — > 00 with 

1 - 

T e/ Xj-Pln/k) Yj-Hn/k) \ (') =>■ 



(2.3) 



—(7) 

in M+([— 00, 00] x E ). RecaUYm > Y( 2 ) > ... > Y( n ) are the order statistics of Y\, 
order and ordering the Y's in (2.3 1 allows us to write the equivalent statement 



, Y„ in decreasing 



(2.4) 

Define the measure v-y by 



1 - 



MO- 



^ 7 ((y,oo]nE (7) ) =(i+ 7J/ )-V7 ) „ e e<t), 

and sometimes, here and elsewhere, we sloppily write 1/7(3/, 00]. Taking marginal convergence in (2.3 1, or 
using (1.2 1, we have with that 



1 " 



i=l 



t(7) 



111 M+(E w; . Using an inversion technique flResnick and Starica, 
(|2007[ page 82)), we get 



(1995) 



de Haan and Ferreira 



(2006), 



Resnick 



(2.5) 



%(fc+i)tl) - p t 7 - 1 



=f(7) 



a(n/k) 7 

in D\ ((0, 00] , E ) , the class of left continuous functions on (0, 00] with range E and with finite right limits 
on (0,oo). The convergence in (2.5) being to a non-random function, we can append it to the convergence 
in (|^2| to get the following ( |Billingsley[ |1968[ p.27): 

~ b(n/k) 
a(n/k) 



(/i n , x n (i) . — 



, i=l ^ «(»/*) ' »(»/*) 



(2.6) 



•(mo, 



- 1 



7 



on M+([-oo,oo] x E (7) ) x D; ((0, 00], E l7J )) . 

Let -Dj ((0,oo],E )) be the subfamily of D;((0,oo],E )) consisting of non-increasing functions and 
define 



F (7), 



by 

where 



Ti : M+([-oo,oo] x E V7J ) x £>/ ((0, oo],E™)) 

Ti^TO, #(•)) = 777* 



■([- 



-OO, OO X 



(0,00]) 



777*([— OO, x] X (t, Oo]) — 777([— OO, x] X (x(t 1 ),Oo]), X €E [ — 00,Oo], t G (0, 



This is an a.s. continuous map so apply this to (2.6 1 and 

(2.7) Ti(fi n ,x n ) =>• Ti(fj,, Xoo). 
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For x £ [—00, 00] and y £ (0, 00], the left side of (|2.7| on the set [—00, x] x (y, 00] is 



and since 



iff 



Mn([-°C,x]x( ±L >00] ) 



% - &(n/&) Y"(|- (fe+ i) tl ) - 6(n/fc) 



a(n/k) 



i < (k + l)y" 



a(n/k) 
k+1 



> y, 



the left side of (2.7) on the set [— 00, x] x (y, 00] is 

1 - 



The right side of (2.7 1 on the set [—00,2;] x (y,oo] is 



y 1 ~ 1 

/x([-oo, x] x ( — - — ,00]) = fj,*([-oc,x] x (y, 00]), 



1 - 

- e/5-flwn fe-t-i \ =>■ A** 

K i=1 V e«C»/«0 ' i J 



so we conclude 
(2.8) 



in M + ([— 00,00] x (0,oo]). Recall fi*(-) was defined in p~9| 
Now assuming (x, 1) is a continuity point of fj,* we have 



1 

#„(x) :=- y^e/ ^-o(n/ fc ) fe+i\([-oo, x] x (l,oo]) 



(2.9) =V([-oo,x] x (l,oo]) =: 

or in the topology of weak convergence on PM[— 00,00], the probability measures on [—00,00], 

H n => H. 

Define a map T 2 on M + ([— 00, 00] x [l,oo]) x PM[— 00,00] by 

T 2 (m,G) = m* 

where 

m # ([0, z] x (y, 00]) = m([0, G{z)] x (y, 00]) 

or, for /GC([0,1] x [l,oo]), 

™ # (/) = // f(G(x),y)m(dx,dy). 



This map is continuous at (m, G) provided G is continuous. To see this, let / be continuous on [0, 1] x [1, 00] 
and suppose G„ =>■ G and m n — > m. Then 

f(G n (x),y)m n (dx,dy) - // f(G(x),y)m n (dx,dy) 



< 



f(G n (x),y)m n {dx, dy) - j j f(G(x),y)m„(dx, dy) 

f{G{x),y)m n {dx,dy) - II f(G(x),y)m(dx,dy) 



--I + II. 
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For /, convergence to follows by uniform continuity of / and the fact that G n (x) — > G(x) uniformly in x. 
To verify II — > 0, it suffices to note that f(G(x),y) is continuous with compact support [—00,00] x [l,oo] 
and then use m n — ► m. 

Combine §2^ and (|2j]) to get 

(2.10) 



1 fe 



in M + ([— 00,00] x [l,oo]) x PM[— 00,00]. Apply the transformation T 2 discussed in the previous paragraph. 
The limit at [0, x] x (y, 00] is /1* ([—00, H^(x)] x (y, 00]) = L[0, x] x (y, 00]. The converging sequence can be 
written as 

k 



Finally observe 



The result follows. 



,X*-[3(n/k), _ 1 * 



a{n/k) 



3(»/fc) < Xj'-3(n/fe) 1 



T 



□ 



We propose three statistics that can be used to detect whether or not a CEV model is appropriate, and 
if so, whether the model has a product measure in the limit. 



2.2. The Hillish stat istic, Hillish fejn . The Hill estimator ( |Hill| ( |1975| ), |Mason| ( |1982[ ), |de Haan and Ferreira 
(2006 1, Resnick (2007}) is a popular choice for estimating the tail parameter a of a heavy-tailed distribution 
We say that a distribution function F on K is heavy-tailed with tail parameter a > if 

1 - F(tx) __ a 



(2.11) 



lim 



for x > 0. 



t^oo 1 - F(t) 

If Zi, Z2, ■ ■ ■ , Z n are i.i.d from this distribution F and Z^) > ^(2) > ■ • • > ^(n) are the orders statistics of 
the sample in decreasing order, then the Hill estimator defined as 

1 fe 

Hillfc,„ = - ^2 log 



O, /c/7 



is a weakly consistent estimator of — as fc, n - 
integrate the tail empirical measure and using its consistency. See Resnick and Starica (|1995| or Resnick 



J (k+1) 

• 0. One way to obtain the consistency is to 



fl2007[ p. 81). 

The Hillish statistic, based on the ranks of the sample, converges weakly to a constant limit under 
the CEV model. The name is derived from the similarity of proof of this convergence with that of the 
weak consistency of the Hill estimator. Using the notation defined just prior to Section 2.1, and assuming 
(X, Y) := {(X u Y x ), (X 2 , Y 2 ), . . . , (X n ,Y n )}, the Hillish statistic for (X, Y) is defined as 

Hillish fe ,„(X, Y) := 1o S W. log 



(2.12) 



J 



The following proposition provides a convergence result of the Hillish statistic under conditions on fc. 

Proposition 2.2. Suppose (X\, Y\), (X 2 , Y 2 ), ■ ■ ■ , (X n , Y n ) are i.i.d. observations from a CEV model which 
follows (1.2|-(1.6| and suppose H as defined in (1.6 1 is continuous. Assume that k — k{n) —f 00, n — ► 00 
and k/n — > 0. Then 



(2.13) 



Hillishk n 



H*([-oo,H^(-)] x (y,oo]) — — =: 1^. 
x x y 
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Proof. Proposition |2.1| yields 

1 n 

(2-14) ^E e (^,^)(')^ i (-) 

in M + ([0, 1] x [l,oo]). Rewrite ( |2.14| ) for x > l,y > 1 as 

1 - 

(2.15) /i*([x,oo] x (y,ooj) := - e/_^ ^ ([a;, oo] x (y,oo]) =*> n*([-oo,H^(l/x)] x (y,oo}). 

Observe that 



H*([x,oo] x (y,oo\) 



i i 

k 



dx dy 1 

x y k 

k 



hI \([ar,oo] x (y,oo]) 



a; y 



1 ^ , fe , fc + 1 1 ^ , k k , fc + 1, 
- ^log - log — = - ^ log -(log T + log — ) 



i=l 



(2.16) 



1 , fc /, fc+ Is 1 v^, k 

1 2^ lo s jt: log 7 + ( log jfc 2^ lo § w = mhsh k,n + A k 

i=l 



1=1 
fc 



1=1 



where A k := ( log j^j) j, ^2 1°6 f^0xl = 0asfc^oo. Hence if we show 

i—l 



M*([-co,^(i)]x( y ,cx,])-^, 
X a; y 



l l 



then we are done. For N finite we know that 

N N N N 

dx dy p 



(2.17) 



/4([x,oo] x (y,oo\)- 



i i 



x y 



H*([-oo,H^{l/x)] x (y,oo]) 



l l 



dx dy 
x y ' 



since (2.151 implies that the integrand converges in probability and we can use Pratt's Lemma ( Resnick 1999 



page 164) for the convergence of the integral. Note that as TV — > oo the right hand side in equation (2.171 

oo oo 

converges to J J oo, H*~(~)] X (y, oo])^^. So we need to see what happens outside the compact sets. 

11 ' v 

In particular if we can show that for any S > 0, 

(2.18) 



(2.19) 



lim lim sup P 



and lim lim sup P 



1 

/ oo oo 



/4([a;,oo] x (y,oo]) — — > 5 ] = 
x y 



1 N 



MnO>o°] x (y,oo])^y > (5 j = 



then by a standard converging together theorem (Resnick 2007 Theorem 3.5), we are done. Observe that 



n / // ,*( [;E ,oo]x( y ,oo])^^ = i^log| : (loglVlogA ) 

y j = l 3 



1 JV 



< 



\\ \J2 ( lo § j^)^ £ ( l0g 1 V l0g Jn^ (Cauchy-Schwarz) 



x CV 



AT 



where 



and 
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r 2 



J=l J 

k/N 



j<k/N 
1 



.7 — 1 n 



Hence 



lim sup P 



1 iV 



*/r i / isdxdy ■ 

Li n { [x,oo\ x (y,oo J > 6 

x y 



< limsupP(B fcn x C kn , N > S) 



and applying Fatou's Lemma, this is bounded by 



< P 



2x->* 



(JV->oo). 



This shows (2.191 holds and similarly we can show (2.181 holds, and we are done. 



□ 



Suppose (X, Y) := {(Xi, Yi), (X2, Y2), ■ ■ ■ , (X n , Y n )} is a sample from a CEV limit model with normaliz- 
ing functions a, /3, a, b and variational functions ipi,ip2- Let the standardized limit measure be fx* as defined 
in (1.9). Also H(x) = 00, x] x (l,oo]). Then (—X,Y) is also a sample from a CEV limit model but 

with normalizing functions a = a, (3 = —ft, a = a,b = b and variational functions = ipi,ip2 — ~4'2- In 
this case the standardized limit measure is \x* and it is easy to check that for x G R, y > 0, 

(2.20) Li*([-aa,x] x (y,oo]) = fi*([-x, 00] x (y,oo]). 
We also have for x G K 

(2.21) H(x) := iI*([-oo,x] x (l,oo]) = (j,*([-x,oo\ x (l,oo]) = 1 - H(-x). 
Thus, for < p < 1, we have, 

(2.22) H^(p) = -H^(l-p). 

The following proposition characterizes product measure in terms of limits of the Hillish statistic for both 
(X, Y) and (-X,Y). 



Proposition 2.3. Under the conditions of Proposition 2.2 fi* is a product measure if and only if both 

p p 
Hillishk, n (X ,Y) — ► 1 and Hillishk^ n (—X ,Y) — > 1. 



Proof. Evaluating 7^*, the limit of the Hillish statistic as proposed in Proposition 2.2 leads us to the above 
results. Recall that we assume H is continuous. Define for any c > 0, the family H^ c \-) of distribution 
functions as follows: 

H^(x) := c' 1 fi* ([-00, x] x (c-\oo}) = H{^{c)x + Va(c)) 



where ^1, ?/> 2 are as defined in ( 1.10) and the second equa lity can be obtained by using tc instead of t in the 
CEV model property (1.3| (Heffernan and Resnick 2007 page 543). Note that H^ 1 ' = H according to our 
definition. Now, 

H*([-oo,H^(-)} x (y,oo\) =- x yL i*([-oo, #*""(-)] x (y,oo\) 
x y x 
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(2.23) = - x H(Ml/y)H^(-) + ihiVv))- 

V x 

(1) If /i* is a product measure then p* = H x v>\ where v\ ((x, oo]) = x _1 , x > 0. Similarly, p* = H x v\. 
We know that p* being a product measure is equivalent to = 1, ip2 = 0. Thus = H for any 
c > 0. Thus 

OO OO oo oo 

V= / /^([-oo,^( 1 )]x (y ,oo]) da; ^= / f'-HiH-i 1 ))^ 
J J x ' x y J J y x x y 

11 11 

oo oo oo 

lldxdy ( f 1 ^ 2 



Also 



a; y 



J J yx x y \J x 2 ) 
l l l 

OO OO oo 

y>([-oo,J^(l)] x (»,oo])^^ - | j ^([-^(l 1 -)] x (y.oo]) 

11 11 

OO OO oo 

-//;<l-*(^-i))??-(/i*r-l. 

1 1 1 

(2) Conversely assume that J^. = 7^, = 1. We know that Vi( c ) = cP f° r some peR. Let us consider 
the following cases: 

(a) p = 0. This means Vi = 1 an d ^(c) = fclogc for some fc S R. We will show that k must be 0. 
If k > 0, then 



OO OO 

V= / /M-([-oo,^(i)]x(y,oo])^*= / /lF(fl-(I)-*Iogy))^* 
J J x x y J J y x x J x y 

11 11 

oo oo oo 

K r fi_i_djcdy = ( n dx y = l 

J J yx x y \J x A ) 



l l 

Similarly, we can show 



1 = 1 if fc = 
I u * < and J -» < 

M I > 1 if k < 0. 



> 1 if fe > 
= 1 if k = 
< 1 if fc < 0. 



Thus for 1^* = I-» = 1 to hold, we must have k = 0, which implies ^ 2 = and p* becomes a 
product measure. 

(b) /o 7^ 0. We will show that this is not possible under the assumption 1^* = I-, = 1. For c > 0, 

p 

for some iel. Assume first p > 0. Then (^) p < 1 for y > 1. Therefore, for such y, 

(-)^(-) + -((-r-i)<i/-(l) is f-(-) + ->o 

y x p y x x p 

(2.24) iff x<l/H(--)=:6, 5 > 1. 
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Denote 



X (x,y) := H((-)"H-(-) + -((-)" - 1)), x > l,y > 1. 
V x p y 



Since H is non-decreasing 

- X (x,y) = n*([-oo,H-(h x (y,oo}) = Iff ((i)^(I) + *((!)' - 1)) 
y x y y x p y 



<iH(^(i)) = i.i 
y x x y 



iSx<S,y>l. 



Since 1^* = 1, we have 



1 cfedy 

-X(a:.y) = 1 = 

V x y 



1 1 dx dy 

x y x y 



We claim 1 < 8 < oo, since if <5 is either 1 or oo, then (2.261 and (2.27) imply that x{x, y) 
almost everywhere which means 



y x p y x 



which is impossible for all y > 1 when p > 0. 
From (2.27) we have 



l l 



x 2/ 



= 0, 



that is, 

oo S 



l rl 



X(x,y) 



dx dy 
x y 



1 8 



X(x,y) 



dx dy 
x y 



A 



(say), 



where the integrands are non-negative on both sides using (2.261. Now / = 1 implies that 



l l 

oo oo 



/?([-oo,#^(±)] x{yM)~- 
x x y 



l l 

oo oo 



A T*([ J ff-(l-I),oo]x(y,oo])- d2/ 
x x y 



l l 

oo oo 



1 1 

oo oo 



I(l-/i*([-oo,^(l-I)]x(l )0 o])) 

1 ) + V 2 (i/y) 

V x 
. x .~\dx dy 



dx dy 
x y 



dx dy 
x y 
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Use the transformation z = -^j and the above equation becomes 



1 



1 1 



l l 

oo oo 



1 1 

oo oo 



yz-1 

1 1 

yz-1 
1 1 



1 - X(*,V) 

- - x( z ,y) 



i i 

Therefore we have 

oo oo 



ly 



- x(z,y) 



dz dy 
z y 

dz dy 
z y 

dz dy 

z y 



1 1 



iy 



x(x,y) 



dx dy 
x y 



0. 



1 1 



i i 



yz 



1 . dz dy 

z z y 



(2.29) 



(2.30) 



(2.31) 



Since x(x, y) < ^ if and only if x < S from (2.26 1, we have 

oo S 



l l 

x-ly 



x{x,y) 



dx dy 
x y 



1 1 

x-ly 



X(x,y) 



dx dy 
x y 



11 is 
where the integrands on both sides are non-negative. But referring to (2.281 we have 

oo <5 oo 5 



1 1 



iy 



x(x,y) 



dx dy 1 
x y ~ 5 — 



x(x,y) 



dx dy 
x y 



A 



5-1 



with equality holding only if the integrand is almost everywhere. Similarly we have 



1 1 



iy 



X{x,y) 



dx dy 
x y 



< 



1 



5-1 



1 r 
V 



x{x,y) — 



dx dy 
x y 



5-1 
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with equality holding only if the integrand is almost everywhere. The integrand cannot be 
since it will imply x(x,y) = - almost everywhere meaning p = 0. But our assumption is 
p > 0. Thus with strict inequality holding for both (2.301 and (2.311 we have a contradiction 
in equation (2.291. Thus we cannot have p > 0. 

The case with p < can be proved similarly. Hence the result. 

□ 

This corollary provides a detection technique for the limit measure being a product measure. Given a 
sample of size n, we plot Hillish/j .„ for values of k and then try to see whether it stabilizes close to 1 or not. 
If the statistic is close to another value, this is evidence that the model is applicable but the limit measure 
is not product. 

2.3. The Pickandsish statistic, Pickandsish^ , n (jp). Another way to check the suitability of the CEV 
assumption and to detect a product measure in the limit is to use the Pickandsish statistic which is based 
on ratios of differences of ordered concomitants. The statistic is patterned on the Pickands estimate for the 



parameter of an extreme value distribution (Pickands ( 1975 1, de Haan and Ferreira (2006 page 83), Resnick 



(2007 page 93)). For a fixed k < n, recall that X^. k < ... < X£. k are the order statistics in increasing 
order from X^X^, ■ ■ ■ ,X k , the concomitants of F(i) > ... > Y^)> * ne order statistics in decreasing order 
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from Y\, Y2, . . . , Y n . For notational convenience for s < t write X s:t :— X^ s -^.^ . Now define the Pickandsish 
statistic for < p < 1, 



(2.32) 



Pickandsishfc ; „(p) := 



^-pk-.k pfc/2:fc/2 

Y* Y* 

^pk-.k pfc/2:fc 



Proposition 2.4. Suppose (Xi, Yi), . . . , (X n , Y n ) follows a CEV model. Let < p < 1, Then, as k, n — > 00 
with k/n — > 0, we have 



(2.33) 



Pickandsish^ n (p) 



ff-(p)(i-2")-y> 2 (2) 

H^(p)-H^(p/2) ' 



provided H*~{p) — H*~(p/2) ^ 0. -Here "0i and ^2 are defined in (1.10) and p = log(f/'i(c))/log c. 

Proof. Since H n in (|2.9| is a probability distribution converging to the limit H, we may invert the convergence 



and obtain (Resnick, 2007 Proposition 2.2, page 20), 



for < z < 1 for which is continuous. The convergence of H^(-) translates to 

k 

H^(z) = mi{u e R : H n (u) > z} = m£{u G K : ^ e^x*-/s( n /*)^ [—00, u] > kz} 
_Xf kxVk -0(n/k) 



i=i 



(2.34) 



a(n/k) 



H-(z) 



where X^. k < . . . X k . k are the increasing order statistics of the concomitants X*, . . . X%. 
From (2.341, we have, for < p < 1, if fc, n — * 00 and k/n — ► 0, 



(2.35) 
(2.36) 
(2.37) 



x * P k-.k- P( n l k ) p » 

— - — ^77^ W, 

a(n/k) 

X (p/2)k:k ~ P( U l k ) P 

a(n/k) 

X pk/2:k/2 ~ /?(2n/fc) p 



c(2n/fc) 



^(p/2), 



Also recall from (1.10 1 that 

lim^M 
t— >oo a(t) 



</>i(c) = c", 



/9(tc)-/3(*) 



^2(0), 



*->oo a(f) 

,c p -l 

theorem we have 

x lk± ~ ^; fe /2:fe/2 _ x * P k:k - /?(«/*) ^/2:fc/2 " 0(2n/*O a (2n/fc) /?(2n/fc) - /?(n/fc) 



where ^2 can be either or ^(c) = D^-^— for some D ^ and pel. Now note that using Slutsky's 



a{n/k) 



a(n/k) 



a(2n/k) 



a{n/k) 



a(n/k) 



H^(p)~H^(p)2P-M2), 



and also, 



X *pk:k - X (p/2)k:k _ X *pk:k ~ X * (p/2)k .. k ~ P(n/k) 



a(n/k) a(n/k) 

^H^(p)-H^(p/2). 



i{n/k) 
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Since H*~(p) — H*~(p/2) ^ 0, another use of Slutsky gives us 



Pickandsishfc jn (p) = 



(X; k .. k -X* pk/2 . k/2 )a(n/k) p ff«-(p)(l_2')-^ 2 (2) 



( X pk:k ~ X 



(p/2)k:k 



)a(n/k) 



H^(p)-H^(p/2) 



□ 



Corollary 2.5. Suppose there exists < p\ < p 2 < 1 sucft iftai H*~(pi) < H^(p%), and for i = 1,2, 
H^{pi) — H^(pi/2) ^= 0. TTien under the conditions of Proposition \2.4\ p* * s a product measure if and only 
if 

Pickandsishk,ni.Pi) — f Qj i = 1,2. 
Proof. 1°. Assume that /i* is a product measure. Then (01, ^2) = (1,0), i.e., p — and 02 = 0. Hence 

F^(p)(l - 2") - 2 (2) = ff^(p)(l - 1) - = 0. 



Therefore, provided < p < 1 and H^{p) — H*~(p/2) ^ 0, Proposition 

p 



2.4 



implies Pickandsishfe.„(p) — > 0. 



2°. Conversely, suppose pi < P2 and Pickandsishfc,„(pi) A 0, i = 1,2. Hence 
(2.38) F < -(p 1 )(l-2")-^(2)=0 

(1) Suppose p = which means 0i = 1. Then ( 2.38[ ) implies -02(2) = which implies 02 = 0. This 
means /1* is a product measure. 

(2) Suppose p ^ and ip 2 = 0. Then ( |2.38| ) implies - 2') = 0,i = 1,2. This implies 
H*~(pi) = 0,i= 1,2, a contradiction to H*~(pi) < H^(p 2 ). So this supposition is not possible. 

(3) Suppose and 02(c) = D^f± for D ji 0. Then ( gjg) implies (ff^( K ) + f )(1-2 P ) = 0,i= 1,2. 
This means H^(pi) = = 1,2, a contradiction to H^(pi) < H^(p 2 ). So this supposition is 



not possible. 



□ 



Hence we have that p* is a product measure if for p\ < p 2 we have Pickandsishfc^p,) — > 0,i = 1,2. 

2.4. Kendall's Tau p T (k,n). Classically, Kendall's tau statistic (McNeil et al. ( 2005[ )) is used to measure 
the strength of association between two rankings. We use a slightly modified version of the statistic using 
data pertaining to the k maximum Y- values: Y(i) > . . . > Y^), their concomitants XJ", . . . , Xj* and the ranks 
Ri , . . . , Rk of XI , . . . , X k . The Kendall's tau statistic is 

( 2 -39) pAKn) : = whl) £ " L 

l<«<J<fc 

This statistic can also be used to show the appropriateness of the CEV model and to decide if the limit 
measure is a product. We show that under the CEV model p T (k,n) as defined in (2.391 converges in 
probability to a limiting constant and when the CEV model holds with a product measure, the limit is 0. 

First we prove a lemma on copulas in [0, l] 2 which leads to proving convergence for the statistic p T (k,n). 
Recall that a two dimensional copula is any distribution function defined on [0, l] 2 with uniform marginals 
flMcNeil et al] p05t ). 

Lemma 2.6. Suppose {Coo,C n n > 1} are copulas on [0,1] 2 , Coo is continuous and C n =>■ Coo. Then 



(2.40) 

Proof. Since C n 



C 00 (u 1 v)dC 00 (u,v), (n— >oo). 



C n {u-, v~)dC n {u, v) -> 
[o,i] 2 [o,i] 2 
• Coo, the convergence is uniform, that is, we have 

||C„ - Coo 1 1 : = sup \C n (u, v) - Coo(u,v)\ 



Therefore 



C n (u-,v-)dC n (u,v) 



C 00 (u,v)dC 00 (u,v) 



[0,1] 2 



[0,1] 2 
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< / \C n (u-,v-) - Coo(u, v)\dC n {u,v) + / C 00 (u,v)dC n (u,v) - J C 00 (u,v)dC 00 (u,v) 

[0,1] 2 [0,1] 2 [0,1] 2 

<||C„ - Cdo| | + / Coa(u, v)dC n (u, v) - I C 00 (u,v)dC 00 (u,v) 
-> 0. 



[0,1] 2 



[o,i] 2 



□ 



Remark 2.1. From Lemma 2.6 we get that if {C n ;n > 1} are random probability measures and is 
continuous, then 

(2.41) 



C n (u—,v—)dC n (u,v)—* / C 00 (u,v)dC 0D (u, v) 



[o,i] ; 



[0,1] 2 



Define the following copulas on [0, l] 2 : 



(2.42) 
(2.43) 



k 

C K (x,y) := t {[0,x] X [0,y]), (x,y) € [0,1] : 

i=i fc ,fc 

Cu*(x,y) := Ai*([oo,iJ^(a;)] x [y- x ,oo]). 



Proposition 2.7. Suppose (Xi, Yi), (X 2 , Y2), • • • , (-Xn, ^n) o,re i.i.d. observations from a CEV model which 
follows (1.2l-(1.6l and suppose H defined in (1.6 1 is continuous. Assume that k — k(n) — > 00, n — > 00 and 
k/n — > 0. Then 

(2.44) 



p T (fc,n)-^4 / C ll *(x,y)dC^(x,y) - 1 =: J M « 

[0,1] 2 



// /i* is a product measure, J M « = 



Proof. Proposition 2.1 implies that as k, n — > 00 with fc/n — > 0, for < a; < 1, z > 1 
1 fe 

2 — 1 1 

Therefore, for < y < 1, 

C M .(x,y) := ^ e (& x)([°M x tM) = lE e (S »U[M] x [0,y))+o P (l) 



=> ii*([-oo,H^(x)} x (y 1 ,oo]) = Cf(x,y). 
since H is continuous, and replacing k + 1 by k does not matter in the limit. This shows that Cn n => C p 



From Lemma T2. 61 and Remark 12. II we have 

l l 



1 1 



c ^ n { x --,y-)dCu,'{.x,y) 



C^{x,y)dC^(x,y). 



o o 



o o 



Now note that 



Q* 

o,, — 



C^(x-,y-)dC K (x,y) = ^^.(y- 

2= 1 

i=l 1=1 







k 2 HRi<Ri} 

" l<I<i<t 
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Hence we have as k, n — > oo with k/n — > 0, 

l l 

M*.n) = ^(431-l) + £3i / / C fl .(x,y)dC IM ,(x,y)-l=: J^. 

o o 

If (i* is a product, for < x, y < 1, 

C^(x,y) := /x*([-oo,if^(x)] x [y _1 ,oo]) =jx H(H^(x)) = xy. 

Hence 

11 11 

Cm* y) dc V (x, y) = y y xydxdy = - 
oo oo 
and the result follows. □ 



Proposition |2.7| would detect that a limit is not a product if the statistics stabilizes at a non-zero value. 
We have not been able to prove a limit of implies a product measure and doubt the truth of this statement. 

Remark 2.2. The three statistics provided above each have their own advantages and disadvantages. 

• They are not hard to calculate. 

• For the CEV model we have shown that all these statistics stabilize as k, n ~ > oo with k/n — > 0. 

• The rank-based statistics Hillish and Kendall's tau are smooth in nature as the rank transform 
removes the extremely high or low values. 

• The disadvantage of the Pickandsish statistic is that its plot lacks smoothness and exhibits erratic 
behavior for small data sets. 

• Obtaining distributional properties for these statistics would require further limit conditions on the 
variables, presumably some form of second order behavior. 

3. Examples and applications 

In this section we apply the three estimators proposed in Section [2] to data sets and judge their perfor- 
mances in the various cases. First we deal with simulated data from specific models discussed in |Das and| 



Resnick (2008b). Then we apply our techniques to Internet traffic data. 



3.1. Simulation from known CEV limit models. 

Example 3.1. Let X and Y be independent random variables with X ~ N(0, 1) and Y ~ Pareto(l). Then 
the following convergence holds in M+([— oo, oo] x (0, oo]) (actually it is an equality) 



Y \ 

X, — ) G [— oo, x] x (y, oo] = ^(x)y^ 1 , — oo < x < oo, y > 1 



W t 

where <f> denotes the standard normal distribution function. We have a CEV model here with a = 1, j3 = 
0, a(t) = t, b = 0. The limit measure is a product. Hence, theoretically 

p p p 

Hillishfc in — > 1, p T (fc,ri)^0, Pickandsishfc,„(p) — * 0, < p < 1. 

We simulate a sample of size n — 1000 and plot the above estimators for 1 < k < n. For the Pickandsish 
statistic we have chosen p = 0.5. The simulated data supports the theoretical results stated. 

Example 3.2. Let X and Z be independent Pareto random variables where X ~ Pareto(p) and Z ~ 
Pareto(l — p) with < p < 1. Define Y = X A Z. Then we can check that the following holds in 
M + ([0,oo] x (Q, oo]): For x > y > and t large, 



fP 



y x ~P\yP xp) y 
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0.5 



.!» 



-0.5 



1.5 



m 1 



0.5 



0.5 



-0.5 



-1i . . o 1 ' ' — 1 

500 1000 500 1000 

The range of k The range of k 



500 1000 
The range of k 



Figure 1. Pickandsish / t,„(0.5), Hillish^n and p T (k,n) plotted for Example 



3.1 



Theoretically the values of the limits of Hillish^ n and Pickandsishfc ra (p) are as follows. 

oo v x — l ) 



00 oo 

Hillish fc ,„ ^ [ [ ^([0,H^(-)] x (t/.oo]) — ^ = 

J J % x y J J y 

1 1 x = l y=l 



If y p \ dx dy 



T / x y 



1 - 



1 X — 1 p ,X — l\l/p 



1-p 



1 — p X 

oo 

E 

fc=2 



l-O^ X ' 



dx 



Now for < p < 1 we have, 



Pickandsish/- jn (p) 



P H-(p)(l-2")-^(2) 



1 - 2^ 



H^(p)-H~(p/2) 1 



■ l-p \!/P ' 

a-p/2/ 



For calculating the Kendall's tau statistics observe that from definition we have: 

cv y) = y(i - ^(l - 1)) = 2/ - /""(l - x), 



dC^* (ar, y) = (1 - p)y p dxdy. 



Hence we have 
l l 



o o 



<V (x, y)dC> (ar, y) = (1 - p) J I (y - /"'(l - x))y-Pdxdy 

o o 

= 1 - P 1 
2-p 4' 



Therefore 



p T (k,n) ->• 4 / / C M .(x, y)dC fl *(x,y) - 1 



o o 



4(1 - p) 
2-p 



-2 = - 



2p 
2-p' 



For p = 0.5 and p = 0.5, theoretically we have 



p p p 

Hillishfe.n — » 0.5, p T (k, n) — > — 0.67, Pickandsishfc jn (p) — > — 0.75. 
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We simulate a sample of size n = 1000 with p = 0.5 and plot the three statistics for 1 < k < n. For the 
Pickandsish statistic we have chosen p = 0.5. The graphs are consistent with the obtained theoretical limits. 



.»? o 



1.5 



0.5 



500 1000 
The range of k 



0.5 



-0.5 



500 1000 
The range of k 



500 1000 
The range of k 



Figure 2. Pickandsishfc in (0.5), Hillish^n and p T (k,n) plotted for Example 3.2 



3.2. Internet traffic data. Internet traffic data has often provided scope for heavy-tail modeling. Variables 



such as file size, transmission duration and session length have been observed to be heavy-tailed (Maulik 



et al. 


2002 


Resnick 


2003 



Sarvotham et al. 2005) 



We study a particular data set of GPS-synchronized traces that were recorded at the University of Auck- 
land http://wand.cs.waikato.ac.nz/wits. The raw data contains measurements on packet size, arrival time, 
source and destination IP, port number, Internet protocol, etc. We consider traces corresponding exclusively 
to incoming TCP traffic sent on December 8, 1999, between 3 and 4 p.m. The packets were clustered into 
end-to-end (e2e) sessions which are clusters of packets with the same source and destination IP address such 
that the delay between arrival of two successive packets in a session is at most two seconds. We observe 
three variables {(Si,Li,Ri) :l<i< 54353}: 



Si = size or number of bytes transmitted in a session, 
Li = duration or length of the session, 
S- 

Ri = — or average transfer rate associated with a session. 
Li 

The data have been downloaded and processed into sessions by Luis Lopez Oliveros, Cornell University. 

First we check whether the individual variables are heavy-tailed or not. The Pickands estimator and 
moment estimators are weakly consistent for the extreme value parameter 7 (|de Haan and Ferreira (20061) 
when the distribution of the variable under consideration is in D{G 1 ) as in ( 1.1 1. We plot these estimators 
over 1 < k < n and observe whether they stabilize over an interval. The Pickands plot indicates that the 
Pickands estimates of the extreme value parameter are stable for size and duration but not for the transfer 
rate. The moment plot on the other hand shows that the moment estimate of the extreme-value parameter 
stabilizes for duration but does not do that clearly for either size or transfer rate. Recall that the CEV 
model is applicable if either of the variables is in the domain of attraction of an extreme-value distribution. 
Clearly there is an indication that transfer rate might not be in an extreme value domain. 

Now we turn to the three statistics we have devised in this paper first to detect whether we have a CEV 
model and then to check whether the limit measure is a product. First we consider the pair (i?, S) assuming 
the distribution of S is in D(G 7 ) for some 7 € M. Then we consider the pair (R, L) assuming the distribution 
of L is in D(G\) for some A £ K. 

Observe from Figure [4] that neither of the three statistics stabilizes for the observations (R,S). Hence a 
CEV model might not be the right model to apply. On the other hand for (R,L), all the statistics stabilize 
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Figure 3. Pickands plot and moment estimate plot of the EV parameter for size, duration 
and transfer rate. 

at some point. But it is clear they are not stabilizing at a point to indicate product measure. Hence we have 
evidence to model (transfer rate, duration) as a CEV model with a non-product limit. Note that this also 
indicates that we should be able to standardize to regular variation on [0, oo] x (0, oo]. 

4. Conclusion 

The CEV model is intended to provide us with a deeper understanding of multivariate distributions which 
have some components in an extreme-value domain. In our discussion, we have provided statistics to detect 
the CEV model in a bivariate set up. These three statistics perform differently for different data sets as we 
have noted in our examples. A further step would be to find asymptotic distributions for these statistics. On 
another direction, it would be nice to obtain statistics for detection of conditional models in a multivariate 
set up of dimension more than two. 
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